Playback Device Calibration Based on Representative Spectral Characteristics

ABSTRACT

A computing device may maintain a database of representative spectral characteristics. The computing device may also receive particular spectral data associated with a particular playback environment corresponding to the particular playback device. Based on the particular spectral data, the computing device may identify one of the representative spectral characteristics from the database that substantially matches the particular spectral data, and then identify, in the database, an audio processing algorithm based on a) the identified representative spectral characteristic and b) at least one characteristic of the particular playback device. The computing device may then transmit, to the particular playback device, data indicating the identified audio processing algorithm.

CROSS REFERENCE TO RELATED APPLICATIONS

The present application is a continuation of U.S. application Ser. No. 15/089,004 filed Apr. 1, 2016, the disclosure of which is explicitly incorporated by reference herein in its entirety. The present application also incorporates by reference U.S. patent application Ser. No. 14/481,505, filed on Sep. 9, 2014, for “Audio Processing Algorithms;” U.S. patent application Ser. No. 14/481,511, filed on Sep. 9, 2014, for “Playback Device Calibration;” U.S. patent application Ser. No. 14/805,140, filed on Jul. 21, 2015, for “Hybrid Test Tone for Space-Averaged Room Audio Calibration Using a Moving Microphone;” and U.S. patent application Ser. No. 15/088,994, filed on Apr. 1, 2016, for “Updating Playback Device Configuration Information Based on Calibration Data.”

FIELD OF THE DISCLOSURE

The disclosure is related to consumer goods and, more particularly, to methods, systems, products, features, services, and other elements directed to media playback or some aspect thereof.

BACKGROUND

Options for accessing and listening to digital audio in an out-loud setting were limited until in 2003, when SONOS, Inc. filed for one of its first patent applications, entitled “Method for Synchronizing Audio Playback between Multiple Networked Devices,” and began offering a media playback system for sale in 2005. The Sonos Wireless HiFi System enables people to experience music from many sources via one or more networked playback devices. Through a software control application installed on a smartphone, tablet, or computer, one can play what he or she wants in any room that has a networked playback device. Additionally, using the controller, for example, different songs can be streamed to each room with a playback device, rooms can be grouped together for synchronous playback, or the same song can be heard in all rooms synchronously.

Given the ever growing interest in digital media, there continues to be a need to develop consumer-accessible technologies to further enhance the listening experience.

BRIEF DESCRIPTION OF THE DRAWINGS

Features, aspects, and advantages of the presently disclosed technology may be better understood with regard to the following description, appended claims, and accompanying drawings where:

FIG. 1 shows an example media playback system configuration in which certain embodiments may be practiced;

FIG. 2 shows a functional block diagram of an example playback device;

FIG. 3 shows a functional block diagram of an example control device;

FIG. 4 shows an example controller interface;

FIG. 5 shows an example flow diagram for calibrating a particular playback device;

FIG. 6 shows an example playback environment within which a playback device may be calibrated;

FIG. 7 shows an example computing device in communication with an example plurality of playback devices;

FIG. 8 shows an example portion of a database of representative spectral characteristics; and

FIG. 9 shows an example flow diagram for calibrating a playback device.

The drawings are for the purpose of illustrating example embodiments, but it is understood that the inventions are not limited to the arrangements and instrumentality shown in the drawings.

DETAILED DESCRIPTION I. Overview

Examples described herein involve calibrating a playback device within a playback environment based on representative spectral characteristics. Utilizing representative spectral characteristics may facilitate a playback device calibration process that is, as one example, shorter in duration. Other benefits are also possible.

For instance, calibration data may be aggregated by a computing device, such as a server, from the individual calibrations of numerous different playback devices located in numerous different playback environments. From each calibration, a spectral characteristic may be determined corresponding to the audio response of the given playback environment. Each spectral characteristic may represent, for example, the size, shape, furnishing, etc. of the given playback environment (e.g., room) where each playback devices was calibrated. The computing device may gather data relating to playback device characteristics from each calibration as well, such as the device model, hardware and/or software versions, zone configurations, and other manual settings corresponding to each individual playback device that is calibrated.

From the aggregated calibration data, the determined spectral characteristics may be grouped, mathematically and/or otherwise, into a substantially smaller number of representative spectral characteristics that may capture the entire data set within a reasonable margin of error. For instance, thousands or perhaps even millions of individual calibrations from households and playback environments around the world may yield a similar number of spectral characteristics. In some cases, there may be substantial commonality and/or overlap in the distribution of spectral characteristics across all calibrated playback devices and playback environments.

Accordingly, the computing device may analyze the calibration data and determine, for instance, 50 representative spectral characteristics that may represent, within a reasonable margin of error, the scope of potential playback environments in which a playback device may be calibrated. The representative spectral characteristics may be maintained in a database on the computing device, and may be used to simplify additional calibrations that may be performed.

For example, some of the calibrations discussed above that are used as a basis for determining the representative spectral characteristics may involve recording, via a microphone, the playback of audio data from the playback device to be calibrated. The recorded audio data may need to be of a minimum duration, such as 45 seconds, and may need to include a minimum range of frequencies, in order for the computing device to have enough spectral data to determine the spectral characteristic corresponding to the playback environment being calibrated. In some cases, the calibration process may require the microphone to be moved between different spatial locations within the playback environment while the calibration audio is recorded.

Alternatively, the representative spectral characteristics discussed above may be used by the computing device to approximate, with reasonable accuracy, the audio response of the playback environment in question. This may reduce the amount of spectral data that is required, and may therefore simplify the calibration procedure in one or more ways. For example, the computing device may receive spectral data corresponding to only 10 seconds of recorded audio playback. Nonetheless, based on this sample, the computing device may identify one of the representative spectral characteristics that substantially matches the received spectral data. In this way, the duration of audio playback portion the calibration procedure may be shortened.

The requirement for less spatial data to perform a calibration of a given playback device may result in other simplifications as well. For instance, it may allow the microphone to record audio data other than a calibration tone, which may have pre-determined frequency sweeps. Instead, the calibration may be based on the recorded playback of media content having a narrower spectral range. In this way, a calibration might occur during the normal course of enjoying audio playback by the playback device. As another example, the calibration procedure may require less, or perhaps no movement of the microphone throughout the playback environment while the calibration audio is recorded. Other possibilities also exist.

Once the computing device has identified a representative spectral characteristic in the database, it may identify an audio processing algorithm in the database based on the identified representative spectral characteristic and at least one playback device characteristic. For instance, the database may include an entry for an audio processing algorithm that corresponds to the identified representative spectral characteristic in conjunction with a particular model of playback device. Other playback device characteristics may also be used.

The computing device may transmit the identified audio processing algorithm to the playback device. The playback device may then apply the audio processing algorithm, resulting in the playback of audio content that is calibrated to the particular audio characteristics of its playback environment.

In some cases, the use of representative spectral characteristics may facilitate a playback device performing a self-calibration within a playback environment, according to some of the examples discussed above. For example, the playback device may include a microphone, which may be used to record the playback device's own playback of audio data. The playback device may then transmit the recorded audio playback data to the computing device.

Because the microphone in this example is stationary, the recorded audio playback data may include less spectral data then might otherwise be required to accurately determine the audio response of the playback environment “from scratch.” However, as discussed above, the computing device may identify a representative spectral characteristic from the database that substantially matches the spectral data that is received.

The playback device may then receive, from the computing device, data indicating an audio processing algorithm corresponding to the playback environment. The playback device may then apply the audio processing algorithm while playing back further audio content, resulting in audio playback that is calibrated to the playback environment.

As indicated above, the examples involve calibrating a playback device based on representative spectral characteristic data. In one aspect, a method is provided. The method involves maintaining, by a computing device, a database of representative spectral characteristics and receiving, by the computing device, particular spectral data associated with a particular playback environment corresponding to the particular playback device. The method also involves, based on the particular spectral data, identifying one of the representative spectral characteristics from the database that substantially matches the particular spectral data and identifying, in the database, an audio processing algorithm based on a) the identified representative spectral characteristic and b) at least one characteristic of the particular playback device. Further, the method also involves transmitting, to the particular playback device, data indicating the identified audio processing algorithm.

In another aspect, a computing device is provided. The device includes a processor, a non-transitory computer readable medium, and program instructions stored on the non-transitory computer readable medium that, when executed by the processor, cause the computing device to perform functions. The functions include maintaining a database of representative spectral characteristics and receiving particular spectral data associated with a particular playback environment corresponding to a particular playback device. The functions also include, based on the particular spectral data, identifying one of the representative spectral characteristics from the database that substantially matches the particular spectral data and identifying, in the database, an audio processing algorithm based on a) the identified representative spectral characteristic and b) at least one characteristic of the particular playback device. Further, the functions include transmitting, to the particular playback device, data indicating the identified audio processing algorithm.

In yet another aspect, a playback device is provided. The device includes a processor, a microphone, a non-transitory computer readable medium, and program instructions stored on the non-transitory computer readable medium that, when executed by the processor, cause the playback device to perform functions. The functions include playing back first audio content in a playback environment and recording, via the microphone, at least a portion of the played back first audio content. The functions also include transmitting the recorded audio content to a computing device and receiving, from the computing device, data indicating an audio processing algorithm corresponding to the playback environment. Further, the functions include applying the audio processing algorithm while playing back second audio content in the playback environment.

It will be understood by one of ordinary skill in the art that this disclosure includes numerous other embodiments. It will be understood by one of ordinary skill in the art that this disclosure includes numerous other examples. While some examples described herein may refer to functions performed by given actors such as “users” and/or other entities, it should be understood that this description is for purposes of explanation only. The claims should not be interpreted to require action by any such example actor unless explicitly required by the language of the claims themselves.

While some examples described herein may refer to functions performed by given actors such as “users” and/or other entities, it should be understood that this is for purposes of explanation only. The claims should not be interpreted to require action by any such example actor unless explicitly required by the language of the claims themselves.

II. Example Operating Environment

FIG. 1 shows an example configuration of a media playback system 100 in which one or more embodiments disclosed herein may be practiced or implemented. The media playback system 100 as shown is associated with an example home environment having several rooms and spaces, such as for example, a master bedroom, an office, a dining room, and a living room. As shown in the example of FIG. 1, the media playback system 100 includes playback devices 102-124, control devices 126 and 128, and a wired or wireless network router 130.

Further discussions relating to the different components of the example media playback system 100 and how the different components may interact to provide a user with a media experience may be found in the following sections. While discussions herein may generally refer to the example media playback system 100, technologies described herein are not limited to applications within, among other things, the home environment as shown in FIG. 1. For instance, the technologies described herein may be useful in environments where multi-zone audio may be desired, such as, for example, a commercial setting like a restaurant, mall or airport, a vehicle like a sports utility vehicle (SUV), bus or car, a ship or boat, an airplane, and so on.

a. Example Playback Devices

FIG. 2 shows a functional block diagram of an example playback device 200 that may be configured to be one or more of the playback devices 102-124 of the media playback system 100 of FIG. 1. The playback device 200 may include a processor 202, software components 204, memory 206, audio processing components 208, audio amplifier(s) 210, speaker(s) 212, and a network interface 214 including wireless interface(s) 216 and wired interface(s) 218. In one case, the playback device 200 might not include the speaker(s) 212, but rather a speaker interface for connecting the playback device 200 to external speakers. In another case, the playback device 200 may include neither the speaker(s) 212 nor the audio amplifier(s) 210, but rather an audio interface for connecting the playback device 200 to an external audio amplifier or audio-visual receiver.

In one example, the processor 202 may be a clock-driven computing component configured to process input data according to instructions stored in the memory 206. The memory 206 may be a tangible computer-readable medium configured to store instructions executable by the processor 202. For instance, the memory 206 may be data storage that can be loaded with one or more of the software components 204 executable by the processor 202 to achieve certain functions. In one example, the functions may involve the playback device 200 retrieving audio data from an audio source or another playback device. In another example, the functions may involve the playback device 200 sending audio data to another device or playback device on a network. In yet another example, the functions may involve pairing of the playback device 200 with one or more playback devices to create a multi-channel audio environment.

Certain functions may involve the playback device 200 synchronizing playback of audio content with one or more other playback devices. During synchronous playback, a listener will preferably not be able to perceive time-delay differences between playback of the audio content by the playback device 200 and the one or more other playback devices. U.S. Pat. No. 8,234,395 entitled, “System and method for synchronizing operations among a plurality of independently clocked digital data processing devices,” which is hereby incorporated by reference, provides in more detail some examples for audio playback synchronization among playback devices.

The memory 206 may further be configured to store data associated with the playback device 200, such as one or more zones and/or zone groups the playback device 200 is a part of, audio sources accessible by the playback device 200, or a playback queue that the playback device 200 (or some other playback device) may be associated with. The data may be stored as one or more state variables that are periodically updated and used to describe the state of the playback device 200. The memory 206 may also include the data associated with the state of the other devices of the media system, and shared from time to time among the devices so that one or more of the devices have the most recent data associated with the system. Other embodiments are also possible.

The audio processing components 208 may include one or more digital-to-analog converters (DAC), an audio preprocessing component, an audio enhancement component or a digital signal processor (DSP), and so on. In one embodiment, one or more of the audio processing components 208 may be a subcomponent of the processor 202. In one example, audio content may be processed and/or intentionally altered by the audio processing components 208 to produce audio signals. The produced audio signals may then be provided to the audio amplifier(s) 210 for amplification and playback through speaker(s) 212. Particularly, the audio amplifier(s) 210 may include devices configured to amplify audio signals to a level for driving one or more of the speakers 212. The speaker(s) 212 may include an individual transducer (e.g., a “driver”) or a complete speaker system involving an enclosure with one or more drivers. A particular driver of the speaker(s) 212 may include, for example, a subwoofer (e.g., for low frequencies), a mid-range driver (e.g., for middle frequencies), and/or a tweeter (e.g., for high frequencies). In some cases, each transducer in the one or more speakers 212 may be driven by an individual corresponding audio amplifier of the audio amplifier(s) 210. In addition to producing analog signals for playback by the playback device 200, the audio processing components 208 may be configured to process audio content to be sent to one or more other playback devices for playback.

Audio content to be processed and/or played back by the playback device 200 may be received from an external source, such as via an audio line-in input connection (e.g., an auto-detecting 3.5 mm audio line-in connection) or the network interface 214.

The network interface 214 may be configured to facilitate a data flow between the playback device 200 and one or more other devices on a data network. As such, the playback device 200 may be configured to receive audio content over the data network from one or more other playback devices in communication with the playback device 200, network devices within a local area network, or audio content sources over a wide area network such as the Internet. In one example, the audio content and other signals transmitted and received by the playback device 200 may be transmitted in the form of digital packet data containing an Internet Protocol (IP)-based source address and IP-based destination addresses. In such a case, the network interface 214 may be configured to parse the digital packet data such that the data destined for the playback device 200 is properly received and processed by the playback device 200.

As shown, the network interface 214 may include wireless interface(s) 216 and wired interface(s) 218. The wireless interface(s) 216 may provide network interface functions for the playback device 200 to wirelessly communicate with other devices (e.g., other playback device(s), speaker(s), receiver(s), network device(s), control device(s) within a data network the playback device 200 is associated with) in accordance with a communication protocol (e.g., any wireless standard including IEEE 802.11a, 802.11b, 802.11g, 802.11n, 802.11ac, 802.15, 4G mobile communication standard, and so on). The wired interface(s) 218 may provide network interface functions for the playback device 200 to communicate over a wired connection with other devices in accordance with a communication protocol (e.g., IEEE 802.3). While the network interface 214 shown in FIG. 2 includes both wireless interface(s) 216 and wired interface(s) 218, the network interface 214 may in some embodiments include only wireless interface(s) or only wired interface(s).

The playback device 200 may also include one or more microphones 220. The microphone(s) 220 may be used to detect audio data in proximity to the playback device 200, such as voice commands for controlling the playback device 200. Further, the microphone(s) 220 may be used to capture and record audio playback data from the playback device 200, or from one or more other playback devices in proximity to the playback device 200, during a calibration procedure. Other examples and other uses for the microphone(s) 220 are also possible.

In one example, the playback device 200 and one other playback device may be paired to play two separate audio components of audio content. For instance, playback device 200 may be configured to play a left channel audio component, while the other playback device may be configured to play a right channel audio component, thereby producing or enhancing a stereo effect of the audio content. The paired playback devices (also referred to as “bonded playback devices”) may further play audio content in synchrony with other playback devices.

In another example, the playback device 200 may be sonically consolidated with one or more other playback devices to form a single, consolidated playback device. A consolidated playback device may be configured to process and reproduce sound differently than an unconsolidated playback device or playback devices that are paired, because a consolidated playback device may have additional speaker drivers through which audio content may be rendered. For instance, if the playback device 200 is a playback device designed to render low frequency range audio content (i.e. a subwoofer), the playback device 200 may be consolidated with a playback device designed to render full frequency range audio content. In such a case, the full frequency range playback device, when consolidated with the low frequency playback device 200, may be configured to render only the mid and high frequency components of audio content, while the low frequency range playback device 200 renders the low frequency component of the audio content. The consolidated playback device may further be paired with a single playback device or yet another consolidated playback device.

By way of illustration, SONOS, Inc. presently offers (or has offered) for sale certain playback devices including a “PLAY:1,” “PLAY:3,” “PLAY:5,” “PLAYBAR,” “CONNECT:AMP,” “CONNECT,” and “SUB.” Any other past, present, and/or future playback devices may additionally or alternatively be used to implement the playback devices of example embodiments disclosed herein. Additionally, it is understood that a playback device is not limited to the example illustrated in FIG. 2 or to the SONOS product offerings. For example, a playback device may include a wired or wireless headphone. In another example, a playback device may include or interact with a docking station for personal mobile media playback devices. In yet another example, a playback device may be integral to another device or component such as a television, a lighting fixture, or some other device for indoor or outdoor use.

b. Example Playback Zone Configurations

Referring back to the media playback system 100 of FIG. 1, the environment may have one or more playback zones, each with one or more playback devices. The media playback system 100 may be established with one or more playback zones, after which one or more zones may be added, or removed to arrive at the example configuration shown in FIG. 1. Each zone may be given a name according to a different room or space such as an office, bathroom, master bedroom, bedroom, kitchen, dining room, living room, and/or balcony. In one case, a single playback zone may include multiple rooms or spaces. In another case, a single room or space may include multiple playback zones.

As shown in FIG. 1, the balcony, dining room, kitchen, bathroom, office, and bedroom zones each have one playback device, while the living room and master bedroom zones each have multiple playback devices. In the living room zone, playback devices 104, 106, 108, and 110 may be configured to play audio content in synchrony as individual playback devices, as one or more bonded playback devices, as one or more consolidated playback devices, or any combination thereof. Similarly, in the case of the master bedroom, playback devices 122 and 124 may be configured to play audio content in synchrony as individual playback devices, as a bonded playback device, or as a consolidated playback device.

In one example, one or more playback zones in the environment of FIG. 1 may each be playing different audio content. For instance, the user may be grilling in the balcony zone and listening to hip hop music being played by the playback device 102 while another user may be preparing food in the kitchen zone and listening to classical music being played by the playback device 114. In another example, a playback zone may play the same audio content in synchrony with another playback zone. For instance, the user may be in the office zone where the playback device 118 is playing the same rock music that is being playing by playback device 102 in the balcony zone. In such a case, playback devices 102 and 118 may be playing the rock music in synchrony such that the user may seamlessly (or at least substantially seamlessly) enjoy the audio content that is being played out-loud while moving between different playback zones. Synchronization among playback zones may be achieved in a manner similar to that of synchronization among playback devices, as described in previously referenced U.S. Pat. No. 8,234,395.

As suggested above, the zone configurations of the media playback system 100 may be dynamically modified, and in some embodiments, the media playback system 100 supports numerous configurations. For instance, if a user physically moves one or more playback devices to or from a zone, the media playback system 100 may be reconfigured to accommodate the change(s). For instance, if the user physically moves the playback device 102 from the balcony zone to the office zone, the office zone may now include both the playback device 118 and the playback device 102. The playback device 102 may be paired or grouped with the office zone and/or renamed if so desired via a control device such as the control devices 126 and 128. On the other hand, if the one or more playback devices are moved to a particular area in the home environment that is not already a playback zone, a new playback zone may be created for the particular area.

Further, different playback zones of the media playback system 100 may be dynamically combined into zone groups or split up into individual playback zones. For instance, the dining room zone and the kitchen zone 114 may be combined into a zone group for a dinner party such that playback devices 112 and 114 may render audio content in synchrony. On the other hand, the living room zone may be split into a television zone including playback device 104, and a listening zone including playback devices 106, 108, and 110, if the user wishes to listen to music in the living room space while another user wishes to watch television.

c. Example Control Devices

FIG. 3 shows a functional block diagram of an example control device 300 that may be configured to be one or both of the control devices 126 and 128 of the media playback system 100. As shown, the control device 300 may include a processor 302, memory 304, a network interface 306, and a user interface 308. In one example, the control device 300 may be a dedicated controller for the media playback system 100. In another example, the control device 300 may be a network device on which media playback system controller application software may be installed, such as for example, an iPhone™, iPad™ or any other smart phone, tablet or network device (e.g., a networked computer such as a PC or Mac™).

The processor 302 may be configured to perform functions relevant to facilitating user access, control, and configuration of the media playback system 100. The memory 304 may be configured to store instructions executable by the processor 302 to perform those functions. The memory 304 may also be configured to store the media playback system controller application software and other data associated with the media playback system 100 and the user.

In one example, the network interface 306 may be based on an industry standard (e.g., infrared, radio, wired standards including IEEE 802.3, wireless standards including IEEE 802.11a, 802.11b, 802.11g, 802.11n, 802.11ac, 802.15, 4G mobile communication standard, and so on). The network interface 306 may provide a means for the control device 300 to communicate with other devices in the media playback system 100. In one example, data and information (e.g., such as a state variable) may be communicated between control device 300 and other devices via the network interface 306. For instance, playback zone and zone group configurations in the media playback system 100 may be received by the control device 300 from a playback device or another network device, or transmitted by the control device 300 to another playback device or network device via the network interface 306. In some cases, the other network device may be another control device.

Playback device control commands such as volume control and audio playback control may also be communicated from the control device 300 to a playback device via the network interface 306. As suggested above, changes to configurations of the media playback system 100 may also be performed by a user using the control device 300. The configuration changes may include adding/removing one or more playback devices to/from a zone, adding/removing one or more zones to/from a zone group, forming a bonded or consolidated player, separating one or more playback devices from a bonded or consolidated player, among others. Accordingly, the control device 300 may sometimes be referred to as a controller, whether the control device 300 is a dedicated controller or a network device on which media playback system controller application software is installed.

The control device 300 may also include one or more microphones 310. The microphone(s) 310 may be used to detect audio data in proximity to the control device 300, such as voice commands for controlling the control device 300. Further, the microphone(s) 310 may be used to capture and record audio playback data from a playback device, such as the playback device 200 shown in FIG. 2, during a calibration procedure of a playback device 200. Other examples and other uses for the microphone(s) 310 are also possible.

The user interface 308 of the control device 300 may be configured to facilitate user access and control of the media playback system 100, by providing a controller interface such as the controller interface 400 shown in FIG. 4. The controller interface 400 includes a playback control region 410, a playback zone region 420, a playback status region 430, a playback queue region 440, and an audio content sources region 450. The user interface 400 as shown is just one example of a user interface that may be provided on a network device such as the control device 300 of FIG. 3 (and/or the control devices 126 and 128 of FIG. 1) and accessed by users to control a media playback system such as the media playback system 100. Other user interfaces of varying formats, styles, and interactive sequences may alternatively be implemented on one or more network devices to provide comparable control access to a media playback system.

The playback control region 410 may include selectable (e.g., by way of touch or by using a cursor) icons to cause playback devices in a selected playback zone or zone group to play or pause, fast forward, rewind, skip to next, skip to previous, enter/exit shuffle mode, enter/exit repeat mode, enter/exit cross fade mode. The playback control region 410 may also include selectable icons to modify equalization settings, and playback volume, among other possibilities.

The playback zone region 420 may include representations of playback zones within the media playback system 100. In some embodiments, the graphical representations of playback zones may be selectable to bring up additional selectable icons to manage or configure the playback zones in the media playback system, such as a creation of bonded zones, creation of zone groups, separation of zone groups, and renaming of zone groups, among other possibilities.

For example, as shown, a “group” icon may be provided within each of the graphical representations of playback zones. The “group” icon provided within a graphical representation of a particular zone may be selectable to bring up options to select one or more other zones in the media playback system to be grouped with the particular zone. Once grouped, playback devices in the zones that have been grouped with the particular zone will be configured to play audio content in synchrony with the playback device(s) in the particular zone. Analogously, a “group” icon may be provided within a graphical representation of a zone group. In this case, the “group” icon may be selectable to bring up options to deselect one or more zones in the zone group to be removed from the zone group. Other interactions and implementations for grouping and ungrouping zones via a user interface such as the user interface 400 are also possible. The representations of playback zones in the playback zone region 420 may be dynamically updated as playback zone or zone group configurations are modified.

The playback status region 430 may include graphical representations of audio content that is presently being played, previously played, or scheduled to play next in the selected playback zone or zone group. The selected playback zone or zone group may be visually distinguished on the user interface, such as within the playback zone region 420 and/or the playback status region 430. The graphical representations may include track title, artist name, album name, album year, track length, and other relevant information that may be useful for the user to know when controlling the media playback system via the user interface 400.

The playback queue region 440 may include graphical representations of audio content in a playback queue associated with the selected playback zone or zone group. In some embodiments, each playback zone or zone group may be associated with a playback queue containing information corresponding to zero or more audio items for playback by the playback zone or zone group. For instance, each audio item in the playback queue may comprise a uniform resource identifier (URI), a uniform resource locator (URL) or some other identifier that may be used by a playback device in the playback zone or zone group to find and/or retrieve the audio item from a local audio content source or a networked audio content source, possibly for playback by the playback device.

In one example, a playlist may be added to a playback queue, in which case information corresponding to each audio item in the playlist may be added to the playback queue. In another example, audio items in a playback queue may be saved as a playlist. In a further example, a playback queue may be empty, or populated but “not in use” when the playback zone or zone group is playing continuously streaming audio content, such as Internet radio that may continue to play until otherwise stopped, rather than discrete audio items that have playback durations. In an alternative embodiment, a playback queue can include Internet radio and/or other streaming audio content items and be “in use” when the playback zone or zone group is playing those items. Other examples are also possible.

When playback zones or zone groups are “grouped” or “ungrouped,” playback queues associated with the affected playback zones or zone groups may be cleared or re-associated. For example, if a first playback zone including a first playback queue is grouped with a second playback zone including a second playback queue, the established zone group may have an associated playback queue that is initially empty, that contains audio items from the first playback queue (such as if the second playback zone was added to the first playback zone), that contains audio items from the second playback queue (such as if the first playback zone was added to the second playback zone), or a combination of audio items from both the first and second playback queues. Subsequently, if the established zone group is ungrouped, the resulting first playback zone may be re-associated with the previous first playback queue, or be associated with a new playback queue that is empty or contains audio items from the playback queue associated with the established zone group before the established zone group was ungrouped. Similarly, the resulting second playback zone may be re-associated with the previous second playback queue, or be associated with a new playback queue that is empty, or contains audio items from the playback queue associated with the established zone group before the established zone group was ungrouped. Other examples are also possible.

Referring back to the user interface 400 of FIG. 4, the graphical representations of audio content in the playback queue region 440 may include track titles, artist names, track lengths, and other relevant information associated with the audio content in the playback queue. In one example, graphical representations of audio content may be selectable to bring up additional selectable icons to manage and/or manipulate the playback queue and/or audio content represented in the playback queue. For instance, a represented audio content may be removed from the playback queue, moved to a different position within the playback queue, or selected to be played immediately, or after any currently playing audio content, among other possibilities. A playback queue associated with a playback zone or zone group may be stored in a memory on one or more playback devices in the playback zone or zone group, on a playback device that is not in the playback zone or zone group, and/or some other designated device.

The audio content sources region 450 may include graphical representations of selectable audio content sources from which audio content may be retrieved and played by the selected playback zone or zone group. Discussions pertaining to audio content sources may be found in the following section.

d. Example Audio Content Sources

As indicated previously, one or more playback devices in a zone or zone group may be configured to retrieve for playback audio content (e.g. according to a corresponding URI or URL for the audio content) from a variety of available audio content sources. In one example, audio content may be retrieved by a playback device directly from a corresponding audio content source (e.g., a line-in connection). In another example, audio content may be provided to a playback device over a network via one or more other playback devices or network devices.

Example audio content sources may include a memory of one or more playback devices in a media playback system such as the media playback system 100 of FIG. 1, local music libraries on one or more network devices (such as a control device, a network-enabled personal computer, or a networked-attached storage (NAS), for example), streaming audio services providing audio content via the Internet (e.g., the cloud), or audio sources connected to the media playback system via a line-in input connection on a playback device or network devise, among other possibilities.

In some embodiments, audio content sources may be regularly added or removed from a media playback system such as the media playback system 100 of FIG. 1. In one example, an indexing of audio items may be performed whenever one or more audio content sources are added, removed or updated. Indexing of audio items may involve scanning for identifiable audio items in all folders/directory shared over a network accessible by playback devices in the media playback system, and generating or updating an audio content database containing metadata (e.g., title, artist, album, track length, among others) and other associated information, such as a URI or URL for each identifiable audio item found. Other examples for managing and maintaining audio content sources may also be possible.

The above discussions relating to playback devices, controller devices, playback zone configurations, and media content sources provide only some examples of operating environments within which functions and methods described below may be implemented. Other operating environments and configurations of media playback systems, playback devices, and network devices not explicitly described herein may also be applicable and suitable for implementation of the functions and methods.

III. Playback Device Calibration

As further discussed in the related application Ser. Nos. 14/481,505, 14/481,511, and 14/805,140, which are incorporated herein by reference, an example calibration of a playback device in a playback environment may generally involve the playback device playing back a first audio signal, which is detected and recorded by a microphone as a second audio signal. For instance, the playback device may be the playback device 200 shown in FIG. 2, and the microphone 310 may be part of a control device 300, such as a smartphone. In other examples, the microphone may be the microphone 220 of the playback device 200, or of another playback device, among other possibilities.

The first audio signal may be a test or calibration tone having a particular spectral profile. Different test tones may be stored and used based on the particular configuration of the playback device 200, an anticipated genre of music to be played, the particular characteristics of the playback environment (e.g., room size), etc. After detecting the second audio signal, the control device 300 may, in some examples, send data indicating the second audio signal to a computing device. The computing device may be a server that determines, based on the data indicating the second audio signal, an audio processing algorithm for use by the playback device 200 in the playback environment, which may be transmitted back to the playback device 200.

The audio processing algorithm determined for the playback device 200 in a given calibration is based on several inputs. First, each calibration of the playback device 200 may generally aim to calibrate the playback device 200 according to a target frequency response curve. The target frequency response curve may correspond to a frequency response that is considered good-sounding, or pleasing to a typical listener. The target frequency response curve may vary based on the model of playback device 200 (e.g., size), the orientation of the playback device 200 (e.g., vertical or horizontal), or other configurations states of the playback device 200 (e.g., bonded with another playback device). The computing device may store target frequency response curves in a database for each potential configuration, or they may be stored on the playback device 200, the controller device 300, or more than one of the above.

Further, the microphone that detects the first audio signal, whether it is the microphone 310 on the control device, the microphone 220 on the playback device 200, or a microphone on another playback device, has its own audio characteristics. For example, the response of microphone 310 may depend on the type and model of control device 300 used. The microphone response curve may also be stored in the database on the computing device, or elsewhere. With this information, the second audio signal that is detected by the microphone 310 may be normalized considering the known audio characteristics of the microphone 310. Other known adjustments to the second audio signal may also be applied. For instance, in some cases, the control device 300 may be configured to detect the presence of a protective case, and may obtain from the computing device information regarding the effects of the case on the microphone's audio characteristics. Again, this information may be stored in the database on the computing device. Other possibilities also exist.

Additionally, the second audio signal detected by the microphone 310 will reflect a frequency response corresponding to the playback environment where the playback device is located. Unlike the configuration of the playback device 200, the target frequency response curve, microphone response curve, and other device-based inputs, which may be generally known, the frequency response of the playback environment may represent an unknown variable in each calibration. For instance, the frequency response of the playback environment may be based on the size of a given room, its construction materials, the furnishings of the room and their location with in the room, among other factors. Consequently, it may be determined empirically during the calibration using the microphone 310.

Based on at least some of the calibration inputs discussed above, the computing device may determine an audio processing algorithm that, when transmitted to and applied by the playback device 200 in the playback environment, will cause audio playback by the playback device 200 to approach or meet the target frequency response curve. More or fewer calibration inputs than those described above may be used. However, to obtain a playback device calibration that is specific to the playback environment, some calibration procedures may require a minimum amount of recorded spectral data from the playback environment for the computing device to determine an accurate response curve for the playback environment. For instance, some calibrations may require 45 seconds of recorded playback of the test tone, which may proceed through a designated series of spectral sweeps. Further, the calibration may require recorded spectral data from multiple locations in the playback environment, necessitating movement of the microphone about the environment by a user, for instance.

FIG. 6 illustrates an example playback environment 600 including a control device 602, a playback device 604, and a playback device 606. The control device 602, which may be coordinating and/or performing at least a portion of the calibration, may be similar to the control device 300 of FIG. 3. The playback devices 604 and 606 may both be similar to the playback device 200 of FIG. 2, and one or both may be calibrated according to the examples discussed herein. FIG. 6 also illustrates an example path 608 from a first location (a) to a second location (b) within the playback environment 600, which may represent the movement of the control device 602 during a calibration.

FIG. 6 also shows a computing device 610, which may collect, store, and transmit the calibration information described herein. The computing device 610 may be a server in communication with a media playback system that includes the playback devices 604 and 606. The computing device 610 may also be in communication, either directly or indirectly, with the control device 602. While the discussions below may refer to the playback environment 600 of FIG. 6, it should be appreciated that the playback environment 600 is only one example of a playback environment within which a playback device may be calibrated. Other examples are also possible.

The computing device 610 may receive and store in the database additional information relating to each calibration event. The information may be transmitted by each of the playback devices that is calibrated, whether in playback environment 600 or elsewhere. Additionally or alternatively, the information may be transmitted by a control device involved in the calibration. For instance, a given playback device and/or control device may transmit information regarding the date and time of a given calibration, an identification of the playback device including the make, model, and serial number, which may further indicate the “age” of the playback device (since it was manufactured), the zone configuration of the playback device, such as a grouping or pairing state, the software version of the playback device, the hardware and software version of a given control device. Numerous other examples are also possible.

In addition, the computing device 610 may receive and store results from each calibration in the database. For example, the computing device may store the determined response curve for each playback environment in which a playback device is calibrated, including more specific information such as the approximate room size or the proximity of an obstructive object to the playback device, which may be detected by the microphone. Further, the computing device 610 may receive and store the audio processing algorithm that is implemented by each playback device as a result of each calibration. For instance, the computing device 610 may receive and store the specific filter coefficients that are applied by each playback device. As another example, the computing device 610 may receive and store a difference metric for each calibration, which may include an indication of how significantly the sound calibration of the playback device changed as a result of the calibration. The computing device 610 may also receive and store information regarding failed calibration attempts, including the reason for the failure, if known. Other examples are also possible.

In some embodiments, the computing device 610 may be a server that is maintained and operated by the company that sells the playback devices being calibrated, such as SONOS, Inc. Alternatively, a third party may maintain and operate the server on behalf of the playback device company. In other examples, a company may employ the methods described herein across multiple different types of speaker systems, which may include playback devices that are made and sold by various different companies. For example, the server might be operated by an audio content provider, or audio content curating service, among other possibilities.

The calibration information discussed above may be provided to the computing device 610 in a variety of ways. For example, the calibration data may be transmitted to the server directly in real time, during or immediately following each calibration that takes place. However, if there are a relatively large number of calibrations across many devices, this may create bandwidth issues at the computing device 610. Therefore, each playback device may locally store and update a calibration file containing the data discussed above. The calibration file may then be periodically requested (“pulled”) by the computing device 610, perhaps as part of a playback device diagnostic event or a software update. Alternatively, the calibration file may be transmitted (“pushed”) to the computing device 610 by the playback device as part of a diagnostic event or a software update. The calibration file might also be transmitted in response to a manual commend, such as a user input. Other examples are also possible.

In addition to receiving and storing the calibration data, the computing device 610, may also receive and store information from each playback device that is not necessarily related to an individual calibration event. For example, the computing device 610 may receive and store playback device characteristics such as user account(s) data associated with the playback device (e.g., geography, age, gender, etc.), playback history of the playback device and associated data (e.g., listening duration, audio quality, genres, favorites, media sources, etc.) and any manual settings present on the playback device at the time of calibration, such as a manually adjusted equalization (EQ). Other possibilities also exist.

In some embodiments, a calibration check may be performed on a given playback device. For example, each playback device has a target frequency response curve representing how audio content should sound when played in the particular environment by the playback device, once calibrated. Thus, it may be possible to perform a coarse check to determine whether the current calibration of the playback device is within a certain margin of error of the target frequency response curve. For example, the coarse check may include recording audio content from a calibration tone that is simplified in its content, duration, or both. Alternatively, recorded playback of media content may be used for the coarse check. If the current calibration is not within the margin of error, which may be, for example, 10 percent, it may indicate that a calibration should be performed. The user may then be prompted to calibrate the playback device via the interface 308 of the control device 300. In some examples, periodic calibration checks may be performed on a given playback device, such as once per month.

Further, the computing device 610 may receive and store data that may indicate a level of listener satisfaction with each calibration. For instance, if two calibrations for the same device occur in relatively close proximity to each other, with few other variables changing, it may indicate that the listener was dissatisfied with the first calibration. As another example, the computing device may receive an indication that the audio response of the playback device has been manually adjusted following a calibration, which may again indicate that the calibration did not meet the listener's expectations.

On the other hand, the computing device 610 may receive data indicating an increase playback activity following a calibration. This may indicate increased listener satisfaction with the calibration. Further, if the computing device 610 does not receive any recalibration data, nor any data indicating a manual EQ change, it may imply that the user was satisfied. As another example, a simple survey inquiring whether the listener was satisfied with the calibration, or noticed an improvement, may be provided via an interface of the control device 602. Other measures for determining listener satisfaction with a given calibration also exist.

Finally, although the examples described herein may primarily involve the computing device 610 acting as a centralized server receiving and storing calibration information from numerous playback devices across numerous playback environments, it should be understood that the collection, storage, and transmission of calibration information discussed in all of the examples herein may be carried out within a single playback system, by one or more of the playback devices and controllers of the individual system, either temporarily or for an extended period of time. For example, calibration data may be aggregated for all playback device calibrations within a single playback system, and may be stored among one or more of the playback devices or other devices within the playback system. The aggregated calibration data may then be transmitted to another computing device in communication with multiple playback systems. Other examples are also possible.

a. Playback Device Calibration Based on Representative Spectral Characteristics

As noted above, examples discussed herein involve calibrating a playback device within a playback environment based on representative spectral characteristics. Utilizing representative spectral characteristics may facilitate a playback device calibration process that is, for instance, shorter in duration. Other benefits are also possible.

Methods 500 and 900 shown in FIGS. 5 and 9 present embodiments of methods that can be implemented within an operating environment involving, for example, the media playback system 100 of FIG. 1, one or more of the playback device 200 of FIG. 2, and one or more of the control device 300 of FIG. 3. Methods 500 and 900 may include one or more operations, functions, or actions as illustrated by one or more of blocks 502-510 and 902-910. Although the blocks are illustrated in sequential order, these blocks may also be performed in parallel, and/or in a different order than those described herein. Also, the various blocks may be combined into fewer blocks, divided into additional blocks, and/or removed based upon the desired implementation.

In addition, for the methods 500, 900, and other processes and methods disclosed herein, the flowchart shows functionality and operation of one possible implementation of present embodiments. In this regard, each block may represent a module, a segment, or a portion of program code, which includes one or more instructions executable by a processor for implementing specific logical functions or steps in the process. The program code may be stored on any type of computer readable medium, for example, such as a storage device including a disk or hard drive. The computer readable medium may include non-transitory computer readable medium, for example, such as computer-readable media that stores data for short periods of time like register memory, processor cache and Random Access Memory (RAM). The computer readable medium may also include non-transitory media, such as secondary or persistent long term storage, like read only memory (ROM), optical or magnetic disks, compact-disc read only memory (CD-ROM), for example. The computer readable media may also be any other volatile or non-volatile storage systems. The computer readable medium may be considered a computer readable storage medium, for example, or a tangible storage device. In addition, for the methods 500, 900 and other processes and methods disclosed herein, each block in FIGS. 5 and 9 may represent circuitry that is wired to perform the specific logical functions in the process.

At block 502 of the method 500, a computing device may maintain a database of representative spectral characteristics. The representative spectral characteristics may be based on a plurality of spectral data corresponding to the plurality of playback environments. For instance, the computing device may be the computing device 610 shown in FIG. 6, which may act as a central server in communication with the plurality of playback devices.

FIG. 7 shows another example of the computing device according to some of the examples discussed herein. In FIG. 7, the computing device is shown as the computing device 701, in communication with a particular playback device 702, which is to be calibrated in a particular playback environment 703. The particular playback device 702 may be similar to the playback device 200 shown in FIG. 2, as well as the playback devices 604 and 606 shown in FIG. 6. FIG. 7 also shows a plurality of playback devices 704 a, 704 b, and 704 c, located respectively within different playback environments 705 a, 705 b, and 705 c. Each of these playback devices may also be similar to the playback device 200 shown in FIG. 2.

The computing device 701 may receive a plurality of spectral data corresponding to the plurality of playback environments 705 a, 705 b, and 705 c. The received spectral data may be the result of individual calibrations of the playback devices 704 a, 704 b, and 705 c, in response to which the computing device 701 may have determined an audio processing algorithm for use by each playback device. In each case, the received spectral data may indicate the response curve for the respective playback environment in which each respective playback device was calibrated. Further, the computing device 701 may store the received spectral data, as well as the determined audio processing algorithm, for each calibration.

The spectral data received from each of the plurality of playback devices 704 a, 704 b, and 704 c may be recorded audio playback data. For instance, a given playback device may play a pre-determined calibration tone during its calibration event. The calibration tone may be recorded by a microphone 310 of an associated control device 300, a microphone 220 of the given playback device, or another microphone, and then transmitted to the computing device 701. Additionally or alternatively, the recorded audio playback data may include recorded audio playback of media content. For example, a microphone as discussed above may, during the normal course of media content playback by the given playback device, record the media content playback and then transmit the recorded audio playback data to the computing device 701. Other examples are also possible.

In some instances, each recorded audio playback data in the plurality of recorded audio playback data may have the same duration. In other words, each calibration event, across all of the playback devices in the plurality of playback devices, may include the playback and recording of audio data having the same duration. The duration may be, for example, 45 seconds, although other durations are also possible.

Moreover, the plurality of playback devices shown in FIG. 7 may represent far more playback devices than the three depicted. In some cases, the computing device 701 may receive spectral data corresponding to hundreds of thousands, perhaps even millions, of playback devices located in different playback environments around the world. Based on spectral data received from this relatively large sample of playback environments, the computing device 701 may determine a finite number of representative spectral characteristics that may simplify the scope of possible results for a given calibration, and thus may simplify the calibration process.

For example, in some cases, the computing device 701 may determine that the response curves for all of the calibrated playback environments may be grouped, mathematically, into a finite number of representative response curves. For instance, the computing device 701 may determine that there are 50 representative response curves that capture all, or nearly all, of the received spectral data within a 5% confidence margin. In some cases, extreme outliers may be ignored. As additional examples, the computing device 701 may determine 12 representative response curves, or perhaps 100. As noted above, this finite number of representative spectral characteristics may be used in some future calibrations. The total number of representative spectral characteristics may depend on a combination of the overall distribution of the received spectral data, the level of accuracy desired, among other factors.

As another example, the representative spectral characteristics may be determined based on other data, as an alternative to or in addition to a strictly mathematical analysis. For instance, the representative spectral characteristics may be determined based, in part, on an indication within the data set that, for a given set of the received spectral data, a similar audio processing algorithm applied by each respective playback device tends to provide a similar level of listener satisfaction with the calibration. Listener satisfaction with a given calibration may be determined or approximated as noted above. Other bases for determining the representative spectral data also exist.

Other characteristics corresponding to each playback device 704 a, 704 b, and 705 c may be collected for each calibration as well, in addition to the received spectral data. For example, each calibration may be associated with a given model number, serial number, and/or software version of the calibrated playback device, or of a control device 300 that may be involved in the calibration. The playback device characteristics may also include manual settings of the playback device at the time of calibration, such as a zone configuration or a manually adjusted EQ. The computing device 701 may also store the results of each calibration as well, including the determined response curve for each playback environment and the determined audio processing algorithm. Other examples are also possible.

These additional playback device characteristics may also inform the determination of the representative spectral characteristics as well. For instance, each playback device that is calibrated may have an audio characteristic that is generally known, based on the model of the playback device. The same may be true for the microphones used in the calibration process, whether in a control device or in the playback device. Based this data, the computing device 701 may normalize the received spectral data from each calibration to be independent of the model of playback device or control device involved in the calibration.

FIG. 8 illustrates a portion of an example database 800 maintained by the computing device 701, within which the representative spectral characteristics discussed above may be stored. As shown, the portion of the database 800 may include a plurality of entries 801 a, 801 b, 801 c, and 801 d each representing one of the determined representative spectral characteristics. The representative spectral characteristics may be stored in the database 800 as mathematical representations h_(room) ⁻¹(t)−1, h_(room) ⁻¹(t)−2, h_(room) ⁻¹(t)−3, and h_(room) ⁻¹(t)−4, each describing the spectral characteristic of a representative playback environment.

Further, the portion of the database 800 contains a second column containing entries 802 a, 802 b, 802 c, and 802 d, each representing a playback device characteristic. In the example shown in FIG. 8, the playback device characteristic is a device model, in this case a “PLAY: 3” model playback device for each entry. The plurality of entries 802 a, 802 b, 802 c, and 802 d correspond, respectively, to the entries 801 a, 801 b, 801 c, and 801 d. Other device models are also possible. Moreover, different playback device characteristics, such as zone configuration, for example, are also possible in the second column of the database 800. In each case, the playback device characteristic used may result in different corresponding entries in further columns of the database 800.

For example, a third column in the portion of the example database 800 contains a series of entries 803 a, 803 b, 803 c, and 803 d. Each entry in the third column contains coefficients for an audio processing algorithm determined based on the corresponding representative spectral characteristic and the playback device characteristic in the first two columns. For example, the determined representative spectral characteristic 801 a, in conjunction with the playback device characteristic 802 a, i.e., a “PLAY: 3” model device, may correspond to an audio processing algorithm 803 a represented by coefficients w₁, x₁, y₁, and z₁, and so on for the additional rows in the database 800.

The portion of the database 800 shown in FIG. 8 represents only one example of a database of representative spectral characteristics that the computing device 701 may populate and maintain. In some instances, more than one playback device characteristic may be used, such as device model and zone configuration, in the determination of the audio processing algorithms. Further, the representative spectral characteristics may be stored in a different format or mathematical state (i.e., inverse vs. non-inverse functions). In another example, the audio processing algorithms may be stored as functions and/or equalization functions. Other examples are also possible.

Whatever its format and content, the database 800 of representative spectral characteristics may be used to simplify a calibration procedure for a particular playback device located in a particular playback environment. Returning to FIG. 6, the particular playback device may be, for instance, the playback device 604, which may be similar to the playback device 200 shown in FIG. 2. Further, the particular playback environment may be the playback environment 600 shown in FIG. 6.

At block 504 of the method 500, the computing device 610 may receive particular spectral data associated with the particular playback environment 600, corresponding to the particular playback device 604. For example, the control device 602 shown in FIG. 6 may initiate a calibration procedure, and the playback device 604 may begin playback of a predetermined calibration tone that is recorded by a microphone of the control device 602. Thus, the particular spectral data may include recorded audio playback data. The particular spectral data may additionally or alternatively include recorded audio playback of media content, as discussed above.

As noted above, a goal of capturing audio playback by a playback device during calibration is to determine the spectral characteristic of the playback environment in question. The calibration of the particular playback device 604 in the present example has, by virtue of the database 800, a finite number of known representative spectral characteristics as a potential result. Consequently, the computing device 610 may be able to identify one of the representative spectral characteristics based on relatively less spectral data than some other calibration procedures, which may involve empirically determining the spectral characteristic of a given playback environment “from scratch.”

Accordingly, at block 506, based on the particular spectral data, the computing device 610 may identify one of the representative spectral characteristics from the database 800 that substantially matches the particular spectral data. For instance, the computing device 610 may mathematically determine the best fit for the particular spectral data among the representative spectral characteristics. This may represent a relatively high probability that the particular playback environment 600 corresponds to the identified representative spectral characteristic.

The requirement for less particular spectral data in the calibration of particular playback device 604 may lead to simplification of the calibration process in multiple ways. For instance, as noted above, some calibrations may require 45 seconds of recorded audio playback to determine the spectral characteristic of a given playback environment. However, in the present example, the computing device 610 may require particular spectral data that is shorter in duration. For example, the computing device 610 may be able to identify, with relative accuracy, a representative spectral characteristic that substantially matches the particular playback environment 600 based on only 10 seconds of recorded audio playback from particular playback device 604.

Further, in some cases, the control device 602 might not record the audio playback from the particular playback device consecutively to obtain the necessary spectral data. For instance, rather than recording 10 consecutive seconds, the control device 602 might record brief snippets of audio playback each time the control device 602 is in a different location in the particular playback environment 600, and within a predefined range of the particular playback device 604. These snippets, collectively, may have a relatively short duration, yet they may contain sufficient spectral data for the computing device 610 to identify a representative spectral characteristic.

The particular spectral data may also represent a narrower frequency range than might otherwise be required for to determine a playback environment's audio characteristic. Nonetheless, the computing device 610 may identify, with relative accuracy, one of the representative spectral characteristics in the database 800. The ability of the computing device 610 to identify a representative spectral characteristic based on a narrower frequency range may allow for the reduced duration of recorded audio playback, as noted above. For example, instead of recording a pre-determined calibration tone for its entire duration, which may include an extensive sampling of frequencies, the control device 602 may record, and the computing device 610 may receive, only a first portion of the calibration tone. The first portion of the calibration tone may include a coarse sampling of frequencies that may provide sufficient spectral data for the computing device 610 to make a relatively accurate identification of a representative spectral characteristic. Accordingly, some calibration tones may be arranged in this way, such that the may be used for a either a full duration calibration as discussed above, or a shortened calibration as described in the present example.

In addition, the ability of the computing device 610 to identify a representative spectral characteristic based on less frequency data may facilitate the use of media content to calibrate the particular playback device 604, rather than a calibration tone. For example, while some media content may not contain the spectral range of a full-duration calibration tone, it may contain sufficient spectral data to allow the computing device 610 to identify a representative spectral characteristic for the particular playback environment 600, as noted above. In this way, calibration of the particular playback 604 may be performed during the normal course of listening to media content.

Still further, the particular spectral data may include less spatial data than the spectral data obtained in some other calibration procedures. As discussed above with reference to FIG. 6, some calibrations may involve the control device 602 and it associated microphone being moved throughout the playback environment 600. As shown in FIG. 6, the control device 602 may be moved on the circuitous path 608 from point (a) to point (b), recording the spectral data at different spatial locations within the playback environment 600.

In contrast, calibration of the particular playback device 604 in the present example may not require the same degree of spatial sampling. For instance, the control device 602 may be moved on shorter, direct path from point (a) to point (b) while recording the audio playback data. Further, in some examples the control device 602 may not be moved within the particular playback environment 600 at all. Accordingly, this may facilitate a stationary microphone in the particular playback environment 600 capturing audio playback data during the calibration process. Thus, a microphone located on the particular playback device 604, or perhaps on another playback device, such as the playback device 606, may be used.

The amount of particular spectral data that is required to identify a substantial match with one of the representative spectral characteristic may be determined in a number of ways. For example, the computing device 610 may determine, based on the known distribution of representative spectral characteristics and a known calibration tone to be used, that a minimum of 10 seconds of particular spectral data is required to identify a matching representative spectral characteristic with 95% accuracy.

In other examples, the control device 602, or perhaps a playback device within with particular playback environment 600, may transmit the particular spectral data to the computing device 610 as it is captured during the calibration process. The computing device 610 may then determine, on an ongoing basis, the level of confidence in identifying a representative spectral characteristic that substantially matches the received particular spectral data. When the level of confidence reaches a threshold, such as 95%, the computing device 610 may transmit a message to one or both of the particular playback device 604 and the control device 602 that no more particular spectral data is required, and either the playback or recording, or both, may be discontinued accordingly. Other possibilities also exist.

It is possible that, in some instances, the computing device 610 may determine that the particular spectral data received from the particular playback device 604 does not substantially match any of the representative spectral characteristics in the database 800. For example, the particular playback environment may not be sufficiently similar to other playback environments in which playback devices have been calibrated, and the data obtained. In this situation, the user may be informed, via the control device, that his or her playback environment is unique. Further, the user may be prompted to initiate a calibration procedure “from scratch” that includes a more complete spectral analysis of the particular playback environment. Other examples are also possible.

At block 508, the computing device 610 may identify, in the database 800, an audio processing algorithm based on the identified representative spectral characteristic and at least one characteristic of the particular playback device 604. For instance, the computing device may identify the representative spectral characteristic h_(room) ⁻¹(t)−3, corresponding to entry 801 c in the database 800. Further, the computing device 610 may determine that the particular playback device 604 is a “PLAY: 3” model, corresponding to the entry 802 c in the database 800. Based on this data, the computing device 610 may identify the audio processing algorithm at entry 803 c in the database 800, represented by the coefficients w₃, x₃, y₃, and z₃.

Other examples are also possible. For instance, the database 800 may include other entries corresponding to representative spectral characteristic h_(room) ⁻¹(t)−3 in conjunction with different playback device characteristics, such as an example in which the particular playback device 604 is a different model, such as a “PLAY: 5.” This may result in the computing device 610 identifying a different audio processing algorithm within the database 800. As noted above, multiple playback device characteristics may be used in some cases.

At block 510, the computing device 610 may transmit, to the particular playback device 604, data indicating the identified audio processing algorithm. The particular playback device 604 may then apply the identified audio processing algorithm when playing back audio content in the particular playback environment 600.

In some cases, the computing device 610 may store an indication of the identified audio processing algorithm, which may be associated with the particular playback environment 600. This may allow the computing device 610 to transmit data indicating the identified audio processing algorithm to a second particular playback device corresponding to the particular playback environment 600, such as the playback device 606. In this way, the particular playback environment 600 may be calibrated once for the first particular playback device 604, and then the resulting audio processing algorithm may be transmitted to the second particular playback device 606 that may be currently located within the particular playback environment 600. A similar process may be used for a second playback device that may be later added to the particular playback environment 600.

In some examples, the playback device characteristics of the first and second particular playback devices might not be the same. In this situation, the computing device 610 may, based on the identified representative spectral characteristic and the playback device characteristic of the second particular playback device 606, update the identified audio processing algorithm and then transmit the updated audio processing algorithm to the second particular playback device 606. Alternatively, the computing device 610 may identify a different audio processing algorithm from the database 800. Other examples are also possible.

In another embodiment, the computing device 610 may cause an indication of the identified audio processing algorithm to be stored at the particular playback device 604. For example, the audio processing coefficients w₃, x₃, y₃, and z₃ may be stored in memory on the playback device 604. This may allow the particular playback device 604 to be moved to another playback environment and perhaps recalibrated, and then moved back to the particular playback environment 600. In this situation, the particular playback device 604 may apply the previously identified audio processing algorithm stored in its memory, and thus regain its previous calibration to the particular playback environment 600. Further, the particular playback device 604 may transmit the stored indication of the identified audio processing algorithm to other playback devices located within in the particular playback environment 600. Other examples are also possible.

b. Playback Device Self-Calibration Based on Representative Spectral Characteristics

Combining several of the examples discussed above, an example playback device may perform a self-calibration based on representative spectral characteristics. The playback device may be, for example, the playback device 604 shown in FIG. 6, which may be similar to the playback device 200 shown in FIG. 2. The playback device 604 may include a processor 202, a microphone 220, and a non-transitory computer readable medium, such as memory 206. The memory 206 may include program instructions stored thereon that, when executed by the processor 202, cause the playback device 604 to perform functions such as those depicted in the flowchart 900 shown in FIG. 9.

For example, at block 902 of the method 900, the playback device 604 may play back first audio content in a playback environment, such as the playback environment 600. The first audio content may include a pre-determined calibration tone. Additionally or alternatively, the first audio content may include the playback of media content, as discussed above. For instance, the first audio content may be audio content that that playback device 604 is already playing, irrespective of any indication that a calibration procedure has been initiated.

At block 904, the playback device 604 may record, via the microphone 220, at least a portion of the played back first audio content. At block 906, the playback device 604 may transmit the recorded audio content to a computing device, such as the computing device 610 shown in FIG. 7. Further, the playback device 604 may also transmit playback device characteristic information to the computing device 610 that corresponds to the playback device 604.

The playback device characteristic information may include any of the information and data noted above, such as the date and time of a given calibration, the serial number of the playback device 604, the zone configuration of the playback device 604, the software version of the playback device 604, the hardware and software version of a control device 602 associated with the playback device 604, the user accounts associated with the playback device 604, playback history and associated data (genres, favorites, etc.) and any manual settings present on the playback device 204 at the time of calibration, such as a manually adjusted EQ. Numerous other examples are also possible.

In some examples, the computing device 604 may receive an indication from the computing device 610 that a particular duration of recorded audio content is to be recorded and transmitted. Alternatively, the playback device 604 may record and transmit recorded audio content to the computing device 610 continuously until it receives an indication from the computing device 610 that no further recorded audio content is required. In response, the playback device 604 may discontinue playback of the first audio content, discontinue recording the played back first audio content, or both.

At block 908, the playback device 604 may receive, from the computing device 610, data indicating an audio processing algorithm corresponding to the playback environment 600. The audio processing algorithm may be determined by the computing device 610 based on representative spectral characteristics, as well as one or more of the playback device characteristics transmitted by the playback device 604, according the examples and discussion above.

Further, the computing device 604 may store the received indication of the audio processing algorithm corresponding to the playback environment 600 in memory 206. This may allow the playback device 604 to, for instance, recall the determined audio processing algorithm if it is moved and calibrated elsewhere, and then returned to the playback environment 600. Additionally, the playback device 604 may transmit data indicating the audio processing algorithm to other playback devices in the playback environment 600, such as playback device 606, as further discussed above.

At block 910, the playback device 604 may apply the audio processing algorithm while playing back second audio content in the playback environment 600. This may result in the calibration of the playback device 604 and an improved listening experience. As noted in the examples above, the calibration described may be performed by the playback 604 relatively quickly. In some cases, the calibration may also be relatively unnoticeable to a listener, as it may be performed during the normal course of playing back audio content.

In some examples, the playback device 604 may be configured to perform a self-calibration according to the examples above at a given time. For instance, the playback device 604 may perform a self-calibration every 6 months, or whenever it receives a software update, or whenever its zone configuration is updated. As another example, the playback device 604 may be configured to perform a self-calibration at a certain time of day, on a certain day of the week. For instance, playback device 604 may perform a self-calibration during the middle of the day, when nobody is expected to be present in the playback environment 600. In this situation, the playback device 604 may opt to self-calibrate using a pre-determined calibration tone, which may otherwise be disrupting if listeners are present. Numerous other examples, alone or in combination with those above, are also possible

IV. Conclusion

The description above discloses, among other things, various example systems, methods, apparatus, and articles of manufacture including, among other components, firmware and/or software executed on hardware. It is understood that such examples are merely illustrative and should not be considered as limiting. For example, it is contemplated that any or all of the firmware, hardware, and/or software aspects or components can be embodied exclusively in hardware, exclusively in software, exclusively in firmware, or in any combination of hardware, software, and/or firmware. Accordingly, the examples provided are not the only way(s) to implement such systems, methods, apparatus, and/or articles of manufacture.

As indicated above, the examples involve calibrating a playback device based on representative spectral characteristic data. In one aspect, a method is provided. The method involves maintaining, by a computing device, a database of representative spectral characteristics and receiving, by the computing device, particular spectral data associated with a particular playback environment corresponding to the particular playback device. The method also involves, based on the particular spectral data, identifying one of the representative spectral characteristics from the database that substantially matches the particular spectral data and identifying, in the database, an audio processing algorithm based on a) the identified representative spectral characteristic and b) at least one characteristic of the particular playback device. Further, the method also involves transmitting, to the particular playback device, data indicating the identified audio processing algorithm.

In another aspect, a computing device is provided. The device includes a processor, a non-transitory computer readable medium, and program instructions stored on the non-transitory computer readable medium that, when executed by the processor, cause the computing device to perform functions. The functions include maintaining a database of representative spectral characteristics and receiving particular spectral data associated with a particular playback environment corresponding to a particular playback device. The functions also include, based on the particular spectral data, identifying one of the representative spectral characteristics from the database that substantially matches the particular spectral data and identifying, in the database, an audio processing algorithm based on a) the identified representative spectral characteristic and b) at least one characteristic of the particular playback device. Further, the functions include transmitting, to the particular playback device, data indicating the identified audio processing algorithm.

In yet another aspect, a playback device is provided. The device includes a processor, a microphone, a non-transitory computer readable medium, and program instructions stored on the non-transitory computer readable medium that, when executed by the processor, cause the playback device to perform functions. The functions include playing back first audio content in a playback environment and recording, via the microphone, at least a portion of the played back first audio content. The functions also include transmitting the recorded audio content to a computing device and receiving, from the computing device, data indicating an audio processing algorithm corresponding to the playback environment. Further, the functions include applying the audio processing algorithm while playing back second audio content in the playback environment.

Additionally, references herein to “embodiment” means that a particular feature, structure, or characteristic described in connection with the embodiment can be included in at least one example embodiment of an invention. The appearances of this phrase in various places in the specification are not necessarily all referring to the same embodiment, nor are separate or alternative embodiments mutually exclusive of other embodiments. As such, the embodiments described herein, explicitly and implicitly understood by one skilled in the art, can be combined with other embodiments.

The specification is presented largely in terms of illustrative environments, systems, procedures, steps, logic blocks, processing, and other symbolic representations that directly or indirectly resemble the operations of data processing devices coupled to networks. These process descriptions and representations are typically used by those skilled in the art to most effectively convey the substance of their work to others skilled in the art. Numerous specific details are set forth to provide a thorough understanding of the present disclosure. However, it is understood to those skilled in the art that certain embodiments of the present disclosure can be practiced without certain, specific details. In other instances, well known methods, procedures, components, and circuitry have not been described in detail to avoid unnecessarily obscuring aspects of the embodiments. Accordingly, the scope of the present disclosure is defined by the appended claims rather than the forgoing description of embodiments.

When any of the appended claims are read to cover a purely software and/or firmware implementation, at least one of the elements in at least one example is hereby expressly defined to include a tangible, non-transitory medium such as a memory, DVD, CD, Blu-ray, and so on, storing the software and/or firmware. 

We claim:
 1. Tangible, non-transitory computer readable medium having stored thereon program instructions that, when executed by the processor, cause a computing device to perform functions comprising: maintaining a database of representative spectral characteristics; receiving particular spectral data associated with a particular playback environment corresponding to a particular playback device; based on the particular spectral data, identifying one of the representative spectral characteristics from the database that substantially matches the particular spectral data; identifying, in the database, an audio processing algorithm based on a) the identified representative spectral characteristic and b) at least one characteristic of the particular playback device; and transmitting, to the particular playback device, data indicating the identified audio processing algorithm.
 2. The tangible, non-transitory computer readable medium of claim 1, wherein maintaining the database of representative spectral characteristics comprises: receiving a plurality of spectral data corresponding to a respective plurality of playback environments; and determining, based on the plurality of received spectral data, the representative spectral characteristics.
 3. The tangible, non-transitory computer readable medium of claim 1, wherein each recorded audio playback data in the plurality of spectral data comprises a first duration, and wherein the recorded audio playback data from the particular playback device comprises a second duration that is less than the first duration.
 4. The tangible, non-transitory computer readable medium of claim 1, wherein the particular spectral data comprises recorded audio playback of media content.
 5. The tangible, non-transitory computer readable medium of claim 1, wherein the particular playback device is a first particular playback device, and wherein the program instructions, when executed by the processor, further cause the computing device to perform functions comprising: storing an indication of the identified audio processing algorithm; and transmitting, to a second particular playback device corresponding to the particular playback environment, data indicating the identified audio processing algorithm.
 6. The tangible, non-transitory computer readable medium of claim 1, wherein the program instructions, when executed by the processor, further cause the computing device to perform functions comprising: causing an indication of the identified audio processing algorithm to be stored at the particular playback device.
 7. Tangible, non-transitory computer readable medium having stored thereon program instructions that, when executed by a processor, cause a playback device to perform functions comprising: playing back first audio content in a playback environment; recording, via a microphone of the playback device, at least a portion of the played back first audio content; while recording the played back first audio content, transmitting the recorded first audio content to a computing device; receiving, from the computing device, an indication that no further recorded audio content is required; based on the indication that no further recorded audio content is required, discontinuing recording the played back first audio content; receiving, from the computing device, data indicating an audio processing algorithm corresponding to the playback environment; and applying the audio processing algorithm while playing back second audio content in the playback environment.
 8. The tangible, non-transitory computer readable medium of claim 7, wherein playing back the first audio content comprises playing back a pre-determined calibration tone.
 9. The tangible, non-transitory computer readable medium of claim 7, wherein playing back the first audio content comprises playing back media content.
 10. The tangible, non-transitory computer readable medium of claim 7, wherein transmitting the recorded audio content to the computing device further comprises transmitting, to the computing device, playback device characteristic information corresponding to the playback device.
 11. The tangible, non-transitory computer readable medium of claim 7, wherein the program instructions, when executed by the processor, further cause the playback device to perform functions comprising: storing an indication of the audio processing algorithm corresponding to the playback environment in memory.
 12. The tangible, non-transitory computer readable medium of claim 7, wherein the first audio content and the second audio content are different portions of the same media content.
 13. The tangible, non-transitory computer readable medium of claim 7, wherein the audio processing algorithm is a first audio processing algorithm, and wherein the functions further comprise: at a predetermined duration of time after applying the first audio processing algorithm, playing back third audio content in the playback environment; recording, via the microphone of the playback device, at least a portion of the played back third audio content; while recording the played back third audio content, transmitting the recorded third audio content to a computing device; receiving, from the computing device, an indication that no further recorded audio content is required; based on the indication that no further recorded audio content is required, discontinuing recording the played back third audio content; receiving, from the computing device, data indicating a second audio processing algorithm corresponding to the playback environment; and applying the second audio processing algorithm while playing back fourth audio content in the playback environment.
 14. A method comprising: playing back, by a playback device, first audio content in a playback environment; recording, via a microphone of the playback device, at least a portion of the played back first audio content; while recording the played back first audio content, transmitting by the playback device to a computing device, the recorded first audio content; receiving, by the playback device from the computing device, an indication that no further recorded audio content is required; based on the indication that no further recorded audio content is required, discontinuing recording the played back first audio content; receiving, by the playback device from the computing device, data indicating an audio processing algorithm corresponding to the playback environment; and applying by the playback device, the audio processing algorithm while playing back second audio content in the playback environment.
 15. The method of claim 14, wherein playing back by the playback device, the first audio content comprises playing back by the playback device, a pre-determined calibration tone.
 16. The method of claim 14, wherein playing back by the playback device, the first audio content comprises playing back by the playback device, media content.
 17. The method of claim 14, wherein transmitting by the playback device to the computing device, the recorded audio content, further comprises transmitting, by the playback device to the computing device, playback device characteristic information corresponding to the playback device.
 18. The method of claim 14, further comprising: storing by the playback device, an indication of the audio processing algorithm corresponding to the playback environment in memory.
 19. The method of claim 14, wherein the first audio content and the second audio content are different portions of the same media content.
 20. The method of claim 14, wherein the audio processing algorithm is a first audio processing algorithm, further comprising: at a predetermined duration of time after applying by the playback device, the first audio processing algorithm, playing back by the playback device, third audio content in the playback environment; recording, via the microphone of the playback device, at least a portion of the played back third audio content; while recording the played back third audio content, transmitting by the playback device to the computing device, the recorded third audio content; receiving, by the playback device from the computing device, an indication that no further recorded audio content is required; based on the indication that no further recorded audio content is required, discontinuing recording the played back third audio content; receiving, by the playback device from the computing device, data indicating a second audio processing algorithm corresponding to the playback environment; and applying by the playback device, the second audio processing algorithm while playing back fourth audio content in the playback environment. 